Towards speaker independent continuous speechreading

نویسنده

Juergen Luettin

چکیده

This paper describes recent speechreading experiments for a speaker independent continuous digit recognition task. Visual feature extraction is performed by a lip tracker which recovers information about the lip shape and information about the greylevel intensity around the mouth. These features are used to train visual word models using continuous density HMMs. Results show that the method generalises well to new speakers and that the recognition rate is highly variable across digits as expected due to the high visual confusability of certain words.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Linear discriminant analysis for speechreading

This paper investigates the use of Fisher-Rao linear discriminant analysis (LDA) as a means of visual feature extraction for hidden Markov model based automatic speechreading. For every video frame, a three-dimensional region of interest containing the speaker's mouth over a sequence of adjacent frames is lexicographically arranged into a data vector. Such vectors are then projected onto the sp...

متن کامل

Lipreading by Neural Networks: Visual Preprocessing, Learning, and Sensory Integration

Stanford University Stanford, CA 94305 We have developed visual preprocessing algorithms for extracting phonologically relevant features from the grayscale video image of a speaker, to provide speaker-independent inputs for an automatic lipreading ("speechreading") system. Visual features such as mouth open/closed, tongue visible/not-visible, teeth visible/notvisible, and several shape descript...

متن کامل

Tactiling: a usable support system for speechreading?

The purpose of this study was to find out whether deafened adults can take advantage of the extra information in speechreading given by the vibrational and motional patterns picked up by placing a hand on a speaker's throat and shoulder, and how valuable this tactile supplement is as a support system for speechreading. We have named this method--speechreading with tactile supplement--tactiling....

متن کامل

Speechreading Using Probabilistic Models Speechreading Using Probabilistic Models

A robust method for locating and tracking lips in gray level image sequences is described The method learns patterns of shape variability from a training set which constrains the model during image search to only deform in ways similar to the training examples Image search is guided by a learned gray level model which is used to describe the large appearance variability of lips Such variability...

متن کامل

Speechreading using shape and intensity information

We describe a speechreading system that uses both, shape information from the lip contours and intensity information from the mouth area. Shape information is obtained by tracking and parameterising the inner and outer lip boundary in an image sequence. Intensity information is extracted from a grey level model, based on principal component analysis. In comparison to other approaches, the inten...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1997

Towards speaker independent continuous speechreading

نویسنده

چکیده

منابع مشابه

Linear discriminant analysis for speechreading

Lipreading by Neural Networks: Visual Preprocessing, Learning, and Sensory Integration

Tactiling: a usable support system for speechreading?

Speechreading Using Probabilistic Models Speechreading Using Probabilistic Models

Speechreading using shape and intensity information

عنوان ژورنال:

اشتراک گذاری